Categorical Metadata Representation for Customized Text Classification

نویسندگان
چکیده

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Joint Semantic Vector Representation Model for Text Clustering and Classification

Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...

متن کامل

Customized metadata for Internet information

Several search engines, catalogs, and ltering services aim to help users of the Internet deal with a growing information \overload". However, these tools typically are either generic in scope, or limited to the needs of a particular user without regard for reuse in some related context. We propose an approach and architecture for customized ltering and cataloging which bridges these two extreme...

متن کامل

Feature Selection and Representation in Text Classification

Text classification remains an important practical application of both modern machine learning (ML) and natural language processing (NLP) techniques. The influence of these disparate areas of research has contributed much to the success of current state of the art classification methods. This essay provides an overview of the field of text classification, and investigates in particular the topi...

متن کامل

Chapter 2 Text Representation and Classification Methods

Text representation and classification method is the most important research objectives of Text Classification. Text representation is prerequisite of Text Classification mainly because it decides the coding ways of text which directly affect classification performance. In this thesis, we have used statistic topic model for the purpose of reducing dimensionality and simultaneously representing ...

متن کامل

Document Vector Space Representation Model for Automatic Text Classification

Classification of text documents presents a unique challenge to conventional classification algorithms. Due to the existence of large number of features in the datasets, providing a desired representation for text documents can be seen as another problem. In this paper a simple but effective representation model for text documents to tackle the classification problem is discussed. Two different...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Transactions of the Association for Computational Linguistics

سال: 2019

ISSN: 2307-387X

DOI: 10.1162/tacl_a_00263